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(54) NOVEL VEGF-LIKE FACTORS 

(57) A novel human gene having a significanl 
homology with a VEGF-C gene, a member of the VEGF 
family, has been isolated by the PCR method using 
primers designed based on the sequence of EST that Is 
assumed to be homologous with the C-termial region of 
the VEGF-C gene. Mouse and rat genes have been iso- 
lated based on the human gene isolated as above. A 
protein encoded by the above human gene has been 
isolated by introducing the gene into Escherichia coii 
and expressing it The isolated protein and genes can 
be applied to, for example, gene therapy for the VEGF- 
D deficiency, wound healing, and promotion of collateral 
vessel formation. Furthermore, VEGF-D protein inhibi- 
tors can be used as a novel anticancer drug. etc. 
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Description 

Technical Field 

5 [0001] The present invention relates to a protein facXor involved in angiogenesis in himians and falls in the field of 
genetic engineering. 

Background Art 

10 [0002] The process of angiogenesis, in which endothelial cells existing in the inner wall of blocxi vessels of animals 
generate new blood vessels, is triggered by transduction of a specific signal. A variety of substances are reportedly 
involved in this signal transduction. The most notable substance among them is the vascular endothelial growth factor 
(VEGF). VEGF is a protein factor which was isolated and purified, and can increase the proliferation of endothelial cells 
and the permeability of blood vessels (Senger, D. R. et al., Science 219: 983-985 (1983); Fenara. N. and Henzel, W. J.. 

IS Biochem. Biophys. Res. Commun. 161: 851-858 (1989)). It has been reported that the human VEGF gene contains 
eight exons and produces four subtypes consisting of 121 , 165. 189, or 206 amino acid residues, depending on the dif- 
ference in splicing, which causes different secretionpatterns (Houck. K. A. et al.. Mot. Endocrinol. 5: 1806-1814 (1991)). 
It has also been reported that there is a VEGF-specific receptor, f it-1 , and that the binding of VEGF to f lt-1 is important 
for the signal transduction (Vries. C. D. et al.. Science 255: 989-991 (1992)). 

20 [0003] Placental growth factor (PIGF) and platelet-derived growth factor (PDGF) have thus far been isolated and are 
factors related to VEGF These factors are found to pronfK)te proliferation activities of vascular endothelial cells (Magli- 
one, D. et al.. Proc. Natl. Acad. Sci. USA 88: 9267-9271 (1991); Betsholtz. C. et al.. Nature 320: 695-699 (1986)). In 
additfon, VEGF-B (Olofsson. B. et al., Proc, Natl. Acad. Sci. USA 93: 2576-2581 (1996)) and VEGF-C (Lee. J. et 
al .Proc. Natl. Acad. Sd. USA 93: 1988-1992 (1996):Joukov. V. etal.. EMBO J. 15. 290-299(1996)) have recently been 

25 isolated. 

[0004] These factors appear to constitute a family, and this may contain additional unknown Actors. 
[0005] It has been suggested that VEGF is involved In not only vascular formation at the developmental stage but also 
in the pathological neovascularization associated with diabetes, rtieumatoid arthritis, retinopathy, and the growth of 
solid tumors. Furthermore, in addition to its vascular endotiielial ceil growth-promoting effects listed above. VEGF*s abil- 
30 ity to inaease vascular permeability was suggested to be involved in the edema formation resulting from various 
causes. Also, these VEGF family factors may act on not only tiie blood vessels but also the blood cells and the lymphatic 
vessels. They may thus play a role in the differentiation and proliferation of blood cells and the formation of lynphatic 
vessels. Consequently, the VEGF family factors are presently drawing extraordinary attention for developing useful, 
novel drugs. 

35 

Disclosure of the Invention 

[0006] An objective of the present invention is to isolate a novel protein belonging to the VEGF family and a gene 
encoding the protein. We searched for genes having homology to VEGF-C. which is a recently cloned VEGF family 

40 gene, against Expressed Sequence Tags (EST) and Sequence Tagged Sites (STS) in the GenBank database. As a 
result, we found an EST thai was assumed to have homology to the C-terminal portion of VEGF-C. We then designed 
primers based on the sequence, and anplified and Isolated the corresponding cDNA using the 5' RAGE method and 
the 3' RACE metiiod. The nucleotide sequence of the isolated cDNA was detemiined, and tiie deduced amino acid 
sequence therefrom revealed that the amino acid sequence had significant homofogy to that of VEGF-C. Based on the 

45 homology, we teve assumed that the isolated human done is a fourtti member of the VEGF family (hereinafter desig- 
nated as VEGF-D). We have also succeeded in expressing tiie protein encoded by the isolated human VEGF-D gene 
in E. coli cells, and have also purified and isolated it. Furthermore, we have succeeded in isolating the mouse and rat 
VEGF-D genes using tiie isdated human VEGF-D gene. 

[0007] In particular, the present invention relates to a novel protein belonging to the VEGF family and a gene encoding 
so the protein. More specifically it relates to 

(1) A protein shown by SEQ ID N0.1 or having tiie amino acid sequence derived therefrom in which one or more 
amino adds are substituted, deleted, or added; 

(2) A protein encoded by a DNA that hybridizes with the DNA shown by SEQ ID NO. 2; 
55 (3) A DNA encoding tiie protein of (1); 

(4) A DNA hybridizing with ttie DNA shown by SEQ ID NO. 2; 

(5) A vector containing the DNA of (3) or (4); 

(6) A transfomfiant canying the vector of (5); 
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(7) A method of producing the protein of (1) or (2), which corrprlses culturing the transformartl of (6); 

(8) An antibody binding to the protein of (1) or (2); 

(9) A method of saeening a compound binding to the protein of (1) or (2), which comprises a step of detecting the 
activity of the protein of (1) or (2) to bind to a test sample: and 

5 (10) A compound binding to the protein of (1) or (2), wherein said compound has been isolated by the method of 

(9). 

[0C08J The protein of the present invention (VEGF-D) has significant homology to VEGF-C and can be considered to 
be a fourth factor of the VEGF family. Since the major function of VEGF is vascular formation at the developmental 
10 stage and VEGF is considered to be involved in the pathological neovascularization associated with diabetes, rheuma- 
toid arthritis, retinopathy, and the growth of solid tumors, the protein of the pjresent invention is thought to have sintilar 

functions. . . , . . . " . 

[0C09] A person skilled in the art could prepare functionally equivalent proteins through mocfi^^ 
present invention by adding, deleting, or substituting one or more of the amino acids of VEGF-D shown by SEQ ID NO. 
15 1 using known methods. Modifications of the protein can also occur naturally in addition to the artificial modif icatiwis 
desCTibed above. These modified proteins are also included in the present invention. Known methods for adding, delet- 
ing, or substituting amino acids include ttie overlap extension polymerase chain reaction (OE-PCR) method (Gene, 
1989.77(1): 51), 

lOCMO] The DNA encoding VEGF-D of the present invention, shown by SEQ ID NO. 2. is useful for isolating DNAs 
20 encoding ttie proteins having similar functions to VEGF-D in otiier organisms. For example, a person skilled in tiie art 
could routinely isolate homologs of human VEGF-D of the present invention from other organisms by allowing the DNA 
shown by SEQ ID NO. 2, or part thereof, as a probe, to hybridize with the DNA derived from other organisms. The DNA 
that hybridizes witii the DNA shown by SEQ ID NO. 2 is also included in the present invention. TTie other organisms 
include mice, rats, and rabbits. 

25 [0011] The DNA encoding a protein that is functionally equivalent to VEGF-D usually has high homology to the DNA 
shown by SEQ ID NO. 2. The high homology used herein means at least 70% or higher, more preferably 80% or higher, 
and still more preferably 90% or higher of sequence homology. 

[0012] An exanple of the hybridization conditions for isolating ttie DNA having high homology will be given below. 
Prehybridlzation is perfomfied in ExpressHyb Solution at 68''C for 30 minutes. The probe labeled witii a radioisotope is 

30 denatured at 95'^C to 100**C for 2 to 5 minutes and rapidly chilled on ice. The probe is added to a new ExpressHyb Solu- 
tion. The blot is transferred to the solution containing the probe and allowed to hybridize under a temperature gradient 
of 68°C to 55^C for 2 hours. The blot is washed four times, for 1 0 minute each, with a 2 x SSC solution containing 0.05% 
SDS at room temperature. The blot is then washed with a 0.1 x SSC solution containing 0.1% SDS at 45°C for 3 min- 
utes. The blot is subjected to autoradiography. 

35 [001 3] An example of tfie hybridization conditions for isolating the DNA having very high homology will be given below. 
Prehybridlzation is performed In ExpressHyb Solution at 68*'C or 30 minutes. The probe labeled with a radioisotope is 
denatured at 95'C to lOO^C for 2 to 5 minutes and rapidly chilled on ice. The probe is added into a new ExpressHyb 
Solution. The blot is tiransfenred into the solution containing the probe, and allowed to hybridize at 68**C for 1 hour. The 
blot was washed four times, for 10 minute each, with a 2 x SSC solution containing 0.05% SDS at room temperature. 

40 The blot was tiien washed with a 0.1 X SSC solution containing 0.1% SDS at 50«C for 40 minutes, during which ttie 
solution was replaced once. The blot was then subjected to autoradiography 

[0014] Note tiiat the hybridization condition can vary depending on the lengtii of tiie probe (whettier it is an oligomer 
or a probe witii more than several hundred bases), ttie labeling method (whether the probe is radioisotopicalty labeled 
or non-radioisotopically labeled), and ttie type of the target gene to be cloned. A person skilled in ttie art would properly 
45 select the suitable hybridization conditions. In ttie present invention, it is espedaily desirable ttiat ttie condition does not 
allow ttie probe to hybridize witti ttie DNA encoding VEGF-C. 

[0015] The DNA of ttie present inventfon is also used to produce VEGF-D of the present invention as a recombinant 
protein. Specifically, the recombinant protein can be produced in large quantity by incorporating ttie DNA encoding 
VEGF-D (for example, ttie DNA shown by SEQ ID NO. 2) into a suitable expression vector, introducing ttie resulting vec- 

50 tor into a host, and culturing ttie transfbrmant to allow ttie recomtMnant protein to be expressed. 

[0018] The vector to be used for producing ttie recombinant protein is not particularly restricted. However, vectors 
such as pGEWIEX-1 (Promega) or pEF-BOS (Nucleic Acids Res. 1 990, 1 8(1 7): p.5322) are preferable.. Suitable exam- 
ples of the host into which the vector is introduced include E. coli ceils, CHO cells, and COS cells. 
[0017] The VEGF-D protein expressed by ttie transfbrmant can be purified by suitably combining purification freat- 

55 ments such as solubilization with a homogenizer or a sonicator, extraction by various buffers, solubilization or precipi- 
tation by add or alkali, extraction or precipitation with organic solvents, salting out by ammonium sulfate and other 
agents, dialysis, ultrafiltration using membrane faters. gel fittration, ion exchange chromatography, reversed-phase 
chromatography, counter-current dishribution chromatography, high-performance liquid chromatography, isoelecti-ic 
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focusing, gel electrophoresis, or affinity chromatography in which antibodies or receptors are Immobilized. 
[0018] Once the recombinant protein is cArtained. antibodies against it can be prepared using known methods. The 
known methods include preparing polyclonal antibodies by immunizing rabbits, sheep, or other animals with the purified 
protein, and preparing monoclonal antibodies from the antibody-producing cells of immunized mice or rats. These anti- 

5 bodies will make it possible to quantify VEGF. Although the antibodies thus obtained can be used as they are. it will be 
more effective to use the humanized antibodies to reduce tiie immunogenicity. The methods of humanizing the antibod- 
ies include tiie CDR graft method and the method of directiy producing a human antibody. In the CDR Graft method, 
the antibody gene is doned from the monoclonal antibody-producing cells and its antigenic determinant portion Is trans- 
planted into an existing human antibody. In the method of directly producing a human antibody, a mouse whose immune 

10 system has been replaced by the human immune system is Immunized, similar to ordinary monoclonal antibodies. The 
VEGF-D protein or Its antibody thus obtained can be administered Into the body by subcutaneous Injection or a sinrtliar 
method. . _ . . 

[001 9] A person skilled in tiie art could screen compounds that bind to the protein of the present Inventbn by known 
methods. 

IS [0020] For example, such compounds can be obtained by making a cDN A library on a phage vector (such as Xgtl 1 
and ZAP) from ttie cells expected to express the protein ttiat binds to tiie protein of the present invention (such as lung, 
small intestine, and heart cells of mammals), expressing tfie cDNAs on LB-agarose, fixing the expressed proteins onto 
a filter, preparing tiie purified protein of the present Invention as a Wotin-labeled or a fuston protein witfi tiie GST protein, 
and reacting tiiis protein with tiie above filter. The desired compounds could tiien be detected by west western blotting 

20 using streptavidin or an anti-GST antibody (SkolniK E. Y.. Margolis, B., Mohammadi, M.. Lowenstein. E., Rscher. R.. 
Drepps, A., Ullrich, A., and Schlessinger. J. (1991) Ctoning of P13 kinase-associated p85 utilizing a novel mettiod for 
expression/cloning of target proteins for receptor tyrosine kinases. Cell 65: 83-90). Another metinod comprises tiie fol- 
lowing steps. First, express ttie protein of the present invention fused wrtii tiie SRF binding domain or the GAL4 binding 
domain in yeast cells. Second, prepare a cDNA library which expresses cDNAs fused witii ttie transcription activation 

25 domain of VP16 or GAL4 from the cells expected to express a protein ttiat binds to the protein of the present invention. 
Third, Introduce ttie cDNA into tiie above yeast cells. Fourth, isolate tiie library-derived cDNA from ttie positive clones. 
Finally, into-oduce the isolated cDNA into E. coli to allow it to be expressed. (When a proton ttiat binds to tiie protein of 
the present invention is expressed in yeast cells, the reporter gene is activated and the positive clone can be detected.) 
This method can be performed using tiie two-hybrid system (MATCHMAKER Two-Hybrid system, Mammalian MATCH- 

30 MAKER Two-Hybrid Assay Kit. or MATCHMAKER One-HybrkJ System (all by Clontech) orttie HybriZAP Two-Hybrid 
Vector System (Stratagene) (Dafton, S. and Trelsman. R. (1992) Characterization of SAP-1, a protein recruited by 
serum response factor to the c-fos serum response element, Cell 68: 597-612). Alternatively, the binding proteins can 
be screened by preparing a cDNA library from tfie cells expected to express a substance, such as a receptor, which 
binds to the protein of ttie present invention (for example, vascular endottielial cells, bone manrow cells, or lymph duct 

35 cells). Introducing it into such cells as COS, detecting tiie binding of ttie protein of ttie present invention by itself or 
labeled witti a radioisotope or a fluorescence, and cloning proteins ttiat bind to ttie protein of ttie present invention 
(Yamasaki, K., Taga. T, Hirata, Y, Yawata. H., Kawanishi. Y, Seed. B., Taniguchi, T, Hirano. T, and Kishimoto, T (1988) 
Cloning and expression of human interieukin-6 (BSF-2/1FN beta2) receptor, Science 241: 825-828, Fukunaga. R., 
Ishizaka-lkeda, E.. Seto; Y. and Nagata, S. (1990) Expression cloning of a receptor for murine granulocyte colony-stim- 

40 ulating factor, Cell 61 : 341 -350). Still another mettiod comprises applying ttie culture supernatant or ttie cellular extract 
of ttie cells expected to express a protein ttiat binds to the protein of the present invention onto an affinity column to 
which the protein of the present invention has been immobilized, and purifying the proteins specifically bound to ttie col- 
umn. In addition, a DNA encoding ttie protein ttiat binds to the protein of ttie present Invention can be obtained by deter- 
mining ttie amino add sequence of ttie binding protein, syntiiesizing oligonucleotides based on ttie sequence, and 

45 screening a cDNA library with the oligonucleotides as probes. 

[0021 ] Furtiiermore, compounds tiiat bind to the protein of the present invention can be screened by contacting com- 
pounds, a natural substance banK or a random phage peptide display library witii the immobilized protein of the present 
invention and detecting ttie molecules bound to tiie protein. These compounds can also be screened by high ttirough- 
put screening utilizing combinatorial chemistry technology (Wrighton. N. C. Fan^ell, F. X., Chang, R., Kashya^^ A. K., 

50 Barbone, F R. Mulcahy. L S.. Johnson. D. L., Ban-ett. R. W, Jolliffe. L K.. and Dower, W. J., Small peptides as potent 
mimetics of ttie protein hormone erythropoietin. Science (United States) Jul 26 1996. 273: 458-464. Verdine, G.L. The 
combinatorial chemistry of nature. Nature (England) Nov 7 1996, 384: 1M3, Hogan. J.C. Jr. Directed combinatorial 
chemisti^y. Nature (England) Nov 7 1996. 384: 17-19). 

[0022] VEGF-D of ttie present invention may be used for gene therapy by introducing the VEGF-D gene into ttie body 
55 of the patient witti ttie VEGF-D deficiency, or expressing the gene in ttie body. An anti-sense DNA of the VEGF-D gene 
may also be used to inhibit the expression of the gene itself, ttiereby suppressing tfie pattiological neovascularization. 
[0023] Among the many available methods to introduce tiie VEGF-D gene or its antisense DNA into the body, ttie ret- 
rovirus mettiod, ttie liposome mettiod. ttie cationk; liposome mettiod. and ttie adenovirus mettiod are preferable. 
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[0024] In order to express these genes in the body, the genes can be incorporated into a suitable vector and intro- 
duced into the body by the retrovirus method, the liposome method, the cationic liposome method, or the adenovirus 
method. Although the vectors to be used are not particularly limited, such vectors as pAdexIcw and pZIPneo are pref- 
erable. 

5 [0025] The present invention may also be applied for diagnosing disorders caused by abnormalities of the VEGF-D 
gene, for example, by PGR to detect an abnormality of the nucleotide sequence of the VEGF-D gene. 
[0026J Furthermore, according to the present invention, the VEGF-D protein or its agonists can be used to heal 
wounds, promote collateral vessel formation, and aid hematopoiesis by the hematopoietic stem cells, by taking advan- 
tage of the angiogenic effect of the VEGF-D protein. The antibodies against the VEGF-D protein or its antagonists can 

10 be used as the therapeutic agents for pathological neovascularization, lymphatic dysplasia, dyshematopoiesis, or ede- 
mas arising from various causes. The anti-VEGF-D antibodies can be used for diagnosing diseases resulting from 
abnormalproductionof VEGF-D by quantifying VEGF-D. :v . 

Brief Description of the Drawings 

15 

[0027] 

Figure 1 shows the relationship among the VEGF-D gene, the EST sequences, and the primers used for cloning. 
Figure 2 compares the amino acid sequences of EST (H24828) and VEGF-C. 
20 Figure 3 compares the amino acid sequences deduced from the VEGF-D gene and from the known genes of the 
VEGF family proteins. 

Figure 4a shows the hydrophobidty plot of VEGF-D. Figure 4b shows the predictfon of the cleavage site of the 
VEGF-D signal peptide. 

25 Best Mode for Imolementina the Invention 

[00281 The following examples illustrate the present invention in detail, but are not to be constmed to limit the scope 
of the invention. 

30 Example 1 . Homology search by TFASTA method 

[0029] The sequence CGPNKELDENTCQCVC (SEQ ID NO. 3) was designed based on the consensus sequence 
found in the BR3P (Balbiani ring 3 protein) repeat at the C-terminus of VEGF-Q The entire ESTs and STS sequences 
in tiie Genbank database (as of 29 February 1996) were tiien searched by the TFASTA metiiod (Person and Lipman, 
35 Proc. Nati. Acad. Sci. USA 85: 2444-2448 (1988)). The searching conditions used are shown below (Table 1). 



Table 1 





Sequences 


392.210 


40 


Symbols 


135.585,305 




Word Size 


2 




Gap creation penalty 


12.0 


45 


Gap extension penalty 


4.0 



[00301 As a result, an EST (Accession No. H24828) that is considered to code ttie consensus sequence was found. 
The sequence is one of the ESTs registered by TTie \AfeshU-Merck EST Project and nine out of 16 amino acid residues 
50 were identical. Further searching for UniGene by NCBI based on this sequence revealed that five registered sequences 
{T64149. H24780. H24633. H24828. and T64277 (as of 1 March 1996)), including tiie above EST were considered to 
be derived from the same gene. T64277 and T64149, as well as H24828 and H24780. are the cont)ination of the 5' 
sequence and the 3' sequence of the same clones, and the length of the insert in both of these dones was 0.9 kb (Fig. 
1). 

55 [0031 1 Translating the H24828 sequence into a protein sequence in a frame where homology is found suggested tiiat 
this sequence codes 104 C-terminal amino acid residues. Comparing tiiis amino acid sequence witti tiie C-terminus of 
VEGF-C. 28 out of 104 amino acids (27%) were identical. Moreover, the amino adds that are important for maintaining 
the protein structure, such as cysteine and proline, were well consen/ed (Fig. 2). Consented sequences are shown in a 
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black box. 



Example 2. cDNA doning from a library 



5 [0032] Primers for 5' RACE and 3' RACE {5' RACE primer: 5*-AGGGATGGGGAACTTGGAACGCTGAAT-3' (SEQ ID 
NO. 4), 3' RACE primer: 5*-GATCTAATCCAGCACCCCAAAAACTGC-3*(SEQ ID NO. 5)) were designed (Fig. 1). A dou- 
ble-stranded cDNA was synthesized from human lung-derived polyA* RNA using reverse transcriptase. PGR was then 
performed using Marathon-Ready cDNA, Lung (Chlontech). having an adapter cDNA ligated to both ends as a template 
cDNA, and using the above primer and adapter primer (AP-1 primer) as primers. The above adapter cDNA contains the 

10 regions to which the adapter primers AP-1 and AP-2 hybridize. The PCR was performed in a manner such that the sys- 
tem was exposed to treatment at 94**C for 1 min; five cycles of treatment at 94*^0 for 30 sec and at 72°C for 4 min; five 
cydes of treatment at 94*'C for 30 sec and at 70**C for 4 min; then 25 cydes of treatment at 94*»C for 20 sec and at 68**C 
for 4 min. (TaKaRa Ex Taq (Takara Shuzo) and the attached buffer were used ais Taq polymerase instead of Advantage 
KlenTaq Polymerase Mix.) As a result. 1.5kb fragments were amplified at the 5' region and 0.9kb fragments at the 3' 

15 region. These fragments were ctoned with the pCR-Direct Cloning System (Clontech), CR-TRAP Cloning System (Gen- 
Hunter), and PT7Blue-T vector (Novagen). When the 5-RACE fragment was cloned into the pCR-Direct vector, the 
fragment was amplified again using 5*-CTGGTTCGGCCCAGAACTTGGAACGCTGAATCA-3' (SEQ No. 7) and 5'- 
CTCGCTCGCCCACTAATACGACTCACTATAGG-3' (SEQ ID NO. 8) as primers. 



20 Example 3. Nucleotide sequence analysis 



[0033] ABI P RISM Dye Terminator Cycle Sequencing Ready Reaction Kit with Amplitaq DN A Polymerase FS and 377 
A DNA Sequencer (ABI) were used for DNA sequencing. The primers used are the primers in the vectors (5'-AATTAAC- 
CCTCACTAAAGGG-3' (SEQ ID NO. 9). 5'-CCAGGGTTTTCCCAGTCACGAC-3'(SEQ ID NO. 10)). AP-2 primer (5'- 
25 ACTCACTATAQGGCTCGAQCQGC-3' (SEQ ID NO. 1 1)). and 10 primers in the sequence shown below (Table 2). 



30 



35 



40 



SQ1 (SEQ ID NO. 12) 
SQ2 (SEQ ID NO. 13) 
SQ3 (SEQ ID NO. 14) 
SQ4{SEQ ID NO. 15) 
SQ5(SEQ ID NO. 16) 
SQ6(SEQ ID NO. 17) 
SQ7 (SEQ ID NO. 18) 
SQ8 (SEQ ID NO. 19) 
SQ9 (SEQ ID NO. 20) 
SQ10(SEQID NO. 21) 



Table 2 

5-AAGTCTGGAGACCTGCT-3' 

5'-CAGCAGQTCTCCAGACT-3' 

5'-CGCACCCAAQGAATGGA-3' 

5'-TGACACCTGGCCATTCCA-3* 

5-CATCAGATGGTAGTTCAT.3' 

y-ATGCTGAGCGAGAGTCCATA-3' 

S'-CACTAGGTTTGCGGCAACTT-S' 

5'-GCTGTTGGCAAGCACTTACA-3' 

5'-GATCCATCCAGATCCCTGAA-3' 

y-CAGATCAGQGCTGCTTCTA-3' 



45 [0034] Determining the nucleotide sequence of the 1 .5kb fragment at the S'-side and the 0.9kb fragment at the 3*-side 
revealed that the sequence of the overlapping region was identical, confirming that 5'- and 3'-side cDNAs of the desired 
gene were obtained. Determining the entire nudeotide sequence of the cDNA revealed that this novel gene has the full 
length of 2 kb and can code a protein consisting of 354 amino acid residues (SEQ ID NO. 1 and SEQ ID NO. 2). Figure 
1 shows the relation between this gene and the EST sequences registered in the Genbank database. Comparing the 

50 amino acid sequence with other VEGF family proteins revealed that the amino acids that are well conserved between 
family proteins are also conserved in this novel gene, and therefore this gene is obviously a new member of the VEGF 
family (Fig. 3). In Fig. 3, HSVEGF indicates human VEGF; HSVEGF-D, HSVEGF-C. and HSVEGF-B indicate human 
VEGF homdogues (human VEGF-D. human VEGF-C. and human YEGF-B. respectively); HSPDGF-A indrcates 
human PDGF-A; HSPDGF-B indicates human PDGF-B; and HSP1GF2 indicates human P1GF2. The conserved 

55 sequences are shown in a black box. Since VEGF-D is highly homologous to VEGF-C that was doned as the Rt4 llg- 
and, it was presumed to be a ligand to a Flt-4-like receptor. 

[0035] Deducing the signal peptide deavage site (Fig. 4b) by hydrophobicity plot (Fig. 4a) and the method of von Hei- 
jne (von Heljne, G. Nucleic Acids Res. 14, 4683-4690(1986)), N-terminal 21 amino acid resklues may be deaved as 
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signal peptides, and they may also undergo additional processing like VEGF-C. 
Example 4. Northern blot analysis 

5 [0036] A Ito fragment which had been cut out by digestion with EcoRI from the 5'-fragment subcloned into pCR- 
Direct vector, was labeled with[a-^2p]dCTP and used as a probe. Labeling was performed by random priming using 
Ready-to Go DNA labeling beads (Pharmacia). Hybridization was performed in ExpressHyb Hybridization Solution 
(Qontech) by the usual method using Multiple Tissue Northern (MTN) Blot-Human, Human II. Human Fetal, and 
Human Cell lines (Clontech). Significant expression was observed in lung, heart, and intestine. Weak expression was 

10 obsen^ed in skeletal muscle, ovary, colon, and pancreas. The apparent molecular weight of the mRNA was 2.2 kb. and 
the cloned fragment seemed to be almost the full length of the gene. 

Example 5. VEGF-D protein expression in E. coli 

75 10037] Two primers. S'-TCCAGATCTTTTGCGGCAACTTTGTATGACAT-a' (SEQ ID NO, 22) and 5'-CAGGTCGACT- 
CAAACAGGCACTAATTCAGGTAC-3' (SEQ ID NO. 23). were synthesized to amplify the region corresponding to the 
89th to 1 81 st amino add residues of human VEGF cDNA. The thus-obtained DNA fragment was digested with restric- 
tbn enzymes Bglll and Sail, and ligated using ligation krt II (Takara Shuzo Ca. Ltd) to plasmid pQE42 ((QIAGEN), 
which had been digested with restriction enzymes BamHI and Sail. The resulting plasmid was introduced into E. cdi 

20 SG19003[pREP4] (QIAGEN).and a plasmid. which was obtained as designed without any mutation, was selected 
(pQE42-BS3). Plasmid pQE42-BS3 was introduced into E. coli BL21 (Invitorogen) and cultured in 10 ml of L Brotii con- 
taining 100 mg/l bicucilline (amplcillin sodium for injection, Meiji Seika Kaisha, Ltd.). 200 ml of fresh L Broth was then 
inoculated with the culture. After Incubation at 37*C for 1 .5 hours, IPTG was added to 3 mM, and the culture was further 
incubated at Zl'^C for 5 hours. After cells were harvested, a protein was purified with a NI-NTA column following the pro- 

25 tocol of QIAexpress Typeil kit. 

Example 6. Expression of DHFR-VEGF-D fusion protein in E. coli 

[0038] The region corresponding to the 89th to 181st amino add residues of human VEGF cDNA was amplified with 
30 the same primers used in Example 5. The thus-obtained DNA fragment was digested with restriction enzymes Bgll and 
Sail. The fragment was tiien ligated using ligation kit II (Takara Shuzo Ca. Ltd.) to tiie plasmid pQE40 (QIAGEN), which 
had been digested with restriction enzymes BamHI and Sail. The resulting plasmid was introduced into E. coli 
SG19003[pREP4] (QIAGEN). and a plasmid, which was obtained as designed without any mutation, was selected 
(pQE40-BS3). Plasmid pQE40-BS3 was introduced into E. cdi BL21 (Invitrogen) and cultured in 10 ml of L Broth con- 
35 taining 100 mg/l bicudlline (ampidllin sodium for injection. Meiji Seika Kaisha, Ltd.). 200 ml of fresh L Brotfi was then 
inoculated with ttie culture. After incubation at 37*'C for 1 .5 hours, IPTG was added to 3mM. and the culture was further 
incubated at 37^C for 5 hours. After cells were harvested, a DHFR-VEGF-D fusion protein was purified witii a Ni-NTA 
column following the protocol of a QIAexpress Typell kit. 

40 Example 7. Cloning mouse VEGF-D cDNA 

[0039] Two Hybond-N-i- (Amersham) filters (20 cm x 22 cm) on which 1 .5 x 10^ pfu of Mouse lung 5*-stretch cDNA 
library was transfened were prepared. Gradient hybridization from 68''C to 55°C was performed for 2 hours in 
ExpressHyb Hybridization Solution (Clontech) using as a probe an approximately 50 ng Pvu II fragment of human 

45 VEGF-D, which had been labeled with a^^P-dCTP (Amersham) using Ready-To-Go DNA Labeling Beads(-dCTP) 
(Pharmacia). The filters were washed four times in 2 x SSC, 0.05% SDS at room temperature for 10 min. then washed 
in 0.1 x SSC, 0.1% SDS at 45°C for 3 min. The washed fBters were exposed overnight at -SO'C using HyperFilm MP 
(Amersham) and intensifying paper. Positive ctones were suljjected to tiie second screening in the same manner as 
above to isdate a single cfone. Isolated lambda DNAs were purified from the plate lysate using a QIAGEN Lambda 

50 MAX I Kit (Qiagen). Insert DNAs were cut out witii EcoR! and subdoned into pUC1 18 EcoRI/BAP (Takara Shuzo Co., 
Ltd.). Its nucleotide sequence was ttien determined with ABI377 sequencer (Perkin Elmer). The cDNA coding the full 
length of mouse VRGF-D was reconstructed with two of the obtained clones ttiat overlapped each otiier. SEQ ID NO. 
24 shows tiie nucleotide sequence of mouse VEGF-D cDNA and the deduced amino acid sequence therefrom. 

55 Example 8. Cloning rat VEGF-D cDNA 

[00401 Two Hybond-N+ (Amersham) filters (20 cm x 22 cm), on which 1 .5 x 1 0^ pfu of Rat lung 5*-stretch cDNA library 
had been transferred, were prepared. Gradient hybridization from 68''C to 55''C was performed for 2 hours in 
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ExpressH.Fyb Hybridization Solution (Clontech) using as a probe an approximately 1 |ig fragment containing 1-782 bp 
of the mouse VEGF-D cDNA which had been labeled with a^P-dCTP (Amersham) using Ready-To-Go DNA Labeling 
Beads(-<1CTP) (Pharmacia). The filters were washed four times in 2 x SSC. 0.05% SDS at room temperature for 1 0 min, 
then washed in 0.1 x SSC. 0.1% SDS at 45**C for 3 min. The washed filters were exposed overnight at -80°C using 
HyperFllm MP (Amersham) and intensifying paper. Positive clones were subjected to the second screening In the same 
manner as above to isolate a single clone. The isolated positive clone was excised Into pBIuescript using E. coli SOLAR 
(Stratagene) and helper phage ExAssist (Stratagene), then the sequence was determined with ABI377 sequencer (Per- 
kln Elmer). The sequence seemed to be the rat VEGF-D cDNA but did not contain the termination codon. 
[0041 1 To obtain the C-terminal cDN A which had not been obtained, PGR was performed using Marathon-Ready rat 
kidney cDNA (Clontech) as a template and 5' primerGCTGCGAGTQTQTCTGTAAA (SEQ ID NO. 26) and 3* primer 
GGGTAGTGGGCAACAGTGACAGGAA (SEQ ID NO. 27) with 40 cycles of 94°C for 15 sec. 55^C for 30 sec, and 72 °C 
for 2 min. After the thus-obtained fragment was subcloned into pGEM-T vector (promega), the nucleotide sequence 
was determinaj with ABI377 sequencer (PerWn Elmer). The resulting done contained the C-terminus of rat VEGF-D. 
Based on the results of sequencing the clone obtained by plaque hybridization and the done obtained by PGR, the full 
length of the rat VEGF-D sequence was determined. SEQ ID NO. 25 shows the deternined nucleotide sequence and 
the deduced amino ackJ sequence therefrom. 

Industrial Applicabilitv 

[00421 In the present invention, a novel protein (VEGF-D) having significant homology to VEGF-C and its gene have 
been Isolated. VEGF-D appears to be Involved In the pathological neovascularization associated with diabetes, rheu- 
matoid arthritis, the growth of solid tumors, differentiation and proliferation of blood cells, formation of lymphatic vessels, 
and formation of edema resulting from various causes as well as the normal neovascularization at the developmental 
stage. The gene of the present invention can be used to diagnos disorders caused by abnormalities of the VEGF-D 
gene and gene therapy for the VEGF-D defidency. The VEGF-D protein, which is obtained by expressing the gene of 
the present invention, can be used for healing wounds, promoting collateral vessel formation, and aiding hematopoietic 
stemcell proliferation. The antibodies or inhibitors against the VEGF-D protein can be used for treating angiodysplasia 
and lymphangiodysplasia associated with inflammation, edemas arising from various causes, dyshematopoiesis, and, 
as a novel anticancer agent, for treating pathological neovascularization. The VEGF-D protein and its antibodies can be 
useful for diagnosing diseases resulting from abnormal production of VEGF-D. 
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Sequence Listing 

(1) Name or appellation of Applicant: Chugai Research Institute for 
Molecular Medicine/ Inc. 

(2) Title of the Invention: Novel VEGF-like Factor 

(3) Reference Number: C1-802PCT 

(4) Application Number: 

(5) Filing date: 

(6) Country where the priority application was filed and the 
application number of the application: Japan, No. Hei 8-185216 

(7) Priority date: July 15, 1996 

(8) Number of sequences: 27 



SEQ ZD NO: 1 
SEQUENCE LENGTH: 354 
SEQUENCE TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
^ TISSUE TYPES lung 

SEQUENCE DESCRIPTION: 

Met Tyr Arg Glu Trp Val Val Val Asn Val Phe Met Met Leu Tyr Val 

Gin Leu val Gin Gly Ser Ser Asn Glu His Gly Pro Val Lys Arg Ser 

20 25 30 

Ser Gin Ser Thr Leu Glu Arg Ser Glu Gin Gin He Arg Ala Ala Ser 
^ 35 40 45 

Ser Leu Glu Glu Leu Leu Arg He Thr His Ser Glu Asp Trp Lys Leu 

50 55 60 

Trp Arg Cys Arg Leu Arg Leu Lys Ser Phe Thr Ser Met Asp Ser Arg 
65 70 75 80 

Ser Ala Ser His Arg Ser Thr Arg Phe Ala Ala Thr Phe Tyr Asp lie 

85 90 95 

Glu Thr Leu Lys Val He Asp Glu Glu Trp Gin Arg Thr Gin Cys Ser 
100 105 110 



55 
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Pro Arg Glu Thr Cys Val Glu Val Ala Ser Glu Leu Gly Lys Ser Thr 

115 120 125 

Asn Thr Phe Phe Lys Pro Pro Cys Val Asn Val Phe Arg Cys Gly Gly 

130 135 140 

Cys Cys Asn Glu Glu Ser Leu lie Cys Met Asn Thr Ser Thr Ser Tyr 
145 150 155 160 

lie Ser Lys Gin Leu Phe Glu He Ser Val Pro Leu Thr Ser Val Pro 

165 - no : 175 

Glu Leu val Pro Val Lys Val Ala Asn His Thr Gly Cys Lys Cys Leu 

180 185 190 

Pro Thr Ala Pro Arg His Pro Tyr Ser He He Arg Arg Ser lie Gin 

195 200 205 

lie Pro Glu Glu Asp Arg Cys Ser His Ser Lys Lys Leu Cys Pro He 

210 215 220 

Asp Met Leu Trp Asp Ser Asn Lys Cys Lys Cys Val Leu Gin Glu Glu 
225 230 235 240 

Asn Pro Leu Ala Gly Thr Glu Asp His Ser His Leu Gin Glu Pro Ala 

245 250 255 

Leu Cys Gly Pro His Met Met Phe Asp Glu Asp Arg Cys Glu Cys Val 

260 265 270 

Cys Lys Thr Pro Cys Pro Lys Asp Leu He Gin His Pro Lys Asn Cys 

275 280 285 

Ser Cys Phe Glu Cys Lys Glu Ser Leu Glu Thr Cys Cys Gin Lys His 

290 295 300 

Lys Leu Phe His Pro Asp Thr Cys Ser Cys Glu Asp Arg Cys Pro Phe 
305 310 315 320 

His Thr Arg Pro Cys Ala Ser Gly Lys Thr Ala Cys Ala Lys His Cys 

325 330 335 

Arg Phe Pro Lys Glu Lys Arg Ala Ala Gin Gly Pro His Ser Arg Lys 
340 345 350 

Asn Pro 



SEQ ID NO: 2 
SEQUENCE LENGTH: 20O4 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESSS : double 
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TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
TISSUE TYPE: lung 
FEATURE: 

NAME/KEY: CDS 
LOCATION: 403.. 1464 
IDENTIFICATION METHOD: E 
IS SEQUENCE DESCRIPTION: 

CCAGCTTTCT GTARCTGTAA GCATTGGTGG CCACACCACC TCCTTACAAA GCAACTAGAA 60 
CCTGCGGCAT ACATTGGAGA GATTTTTTTA ATTTTCTGGA CAYGAAGTAA ATTTA6AGTG 120 
CTTTCYAATT TCAGGTAGAA GACATGTCCA CCTTCTGATT ATTTTTGGAG AACATTTTGA 180 
TTTTTTTCAT CTCTCTCTCC CCACCCCTAA GATTGTGCAA AAAAAGCGTA CCTTGCCXAA 240 
TTGAAATAAT TTCATTGGAT TTTGATCAGA ACTGATCATT TGGTTTTCTG TGTGAAGTTT 300 
TGAGGTTTCA AACTTTCCTT CTGGAGAATG CCTTTTGAAA CAATTTTCTC TAGCTGCCTG 360 
25 ATGTCAACTG CTTAGTAATC AGTGGATATT GAAATATTCA AA ATG TAC AGA GAG 414 

Met Tyr Arg Glu 
1 

TGG GTA GT6 GTG AAT GTT TTC ATG ATG TTG TAC GTC CA6 CTG GTG CAG 462 

30 

Trp Val Val Val Asn Val Phe Met Met Leu Tyr Val Gin Leu Val Gin 

5 10 , 15 20 

GGC TCC AGT AAT GAA CAT GGA CCA GTG AAG CGA TCA TCT CAG TCC AC A 510 
35 Gly Ser Ser Asn Glu His Gly Pro Val Lys Arg Ser Ser Gin Ser Thr 

25 30 35 

TTG GAA CGA TCT GAA CAG CAG ATC AGG GCT GCT TCT AGT TTG GAG GAA 558 
Leu Glu Ar9 Ser Glu Gin Gin lie Arg Ala Ala Ser Ser Leu Glu Glu 
^ 40 45 50 

CTA CTT CGA ATT ACT CAC TCT GAG GAC TGG AAG CTG TGG AGA TGC AGG 606 
Leu Leu Arg lie Thr His Ser Glu Asp Trp Lys Leu Trp Arg Cys Arg 
45 55 60 65 

CTG AGG CTC AAA AGT TTT ACC AGT ATG GAC TCT CGC TCA GCA TCC CAT 654 
Leu Arg Leu Lys Ser Phe Thr Ser Met Asp Ser Arg Ser Ala Ser His 

70 75 80 

CGG TCC ACT AGG TT7 GCG GCA ACT TTC TAT GAC ATT GAA ACA CTA AAA 702 
Arg Ser Thr Arg Phe Ala Ala Thr Phe Tyr Asp lie Glu Thr Leu Lys 
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20 



25 



30 



40 



45 



SO 



85 90 
GTT ATA GAT GAA GAA TOG CAA 
Val lie As? Glu Glu Trp Gin 
105 

TGC CTG GAG GTG GCC AGT GAG 
Cys Val Glu VaI Aid Ser Glu 
120 

fJiG CCC CCT. TGT GTG AAC GTG 
Lys Pro Pro Cys Val Asn Val 

135 

GAG AGC CTT ATC TGT ATG AAC 
Glu Ser Leu lie Cys Met Asn 
150 155 
CTC TTT GAG ATA TCA GTG CCT 
Leu Phe Glu lie Ser Val Pro 

165 no 

GTT AAA GTT GCC AAT CAT ACA 
Val Lys Val Ala Asn His Thr 
185 

CGC CAT CCA TAC TCA ATT ATC 
Arg His Pro Tyr Ser lie lie 
200 

GAT CGC TGT TCC CAT TCC AA6 
Asp Arg Cys Ser His Ser Lys 
215 

GAT AGC AAC AAA TGT AAA TGT 
Asp Ser Asn Lys Cys Lys Cys 
230 235 
GGA ACA GAA 6AC CAC TCT CAT 

Gly Thr Glu Asp His Ser His 
245 250 
CAC ATG ATG TTT GAC GAA GAT 
His Met Met Phe Asp Glu Asp 
265 

TGT CCC AAA GAT CTA ATC CAG 
Cys Pro Lys Asp Leu He Gin 



AGA ACT 
Arg Thr 

CTG GGG 
Leu Gly 
125 
TTC CGA 
Phe Arg 
140 

ACC AGC 
Thr Ser 

TTG ACA 
Leu Thr 

GGT TGT 
Gly Cys 

AGA AGA 

Arg Arg 
205 
AAA CTC 
Lys Leu 
220 

GTT TTG 
Val Leu 



95 

CAG TGC 
Gin Cys 
110 

AAG AGT 
Lys Ser 

TGT GGT 
Cys Gly 

ACC TCG 
Thr Ser 

TCA GTA 
Ser Val 

175 
AAG TGC 
Lys Cys 
190 

TCC ATC 
Ser lie 

TGT CCT 
Cys Pro 

CAG GAG 
Gin Glu 



CTC CAG GAA CCA 
Leu Gin Glu Pro 

255 

CGT TGC GAG TGT 
Arg Cys Glu Cys 
270 

CAC CCC AAA AAC 
Kis Pro Lys Asn 



AGC CCT 

Ser Pro 

ACC AAC 
Thr Asn 

GGC TGT 
Gly Cys 
145 
TAC ATT 
Tyr He 
160 

CCT GAA 
Pro Glu 

TTG CCA 
Leu Pro 

CAG ATC 
Gin He 

ATT GAC 
He Asp 
225 
GAA AAT 
Glu Asn 
240 

GCT CTC 
Ala Leu 

GTC TGT 
Val Cys 

TGC AGT 
Cys Ser 



100 

AGA GAA ACG 
Arg Glu Thr 
115 

ACA TTC TTC 
Thr Phe Phe 
130 

TGC AAT GAA 
Cys Asn Glu 

TCC AAA CAG 
Ser Lys Gin 

TTA GTG CCT 
Leu Val Pro 
180 

ACA GCC CCC 
Thr Ala Pro 
195 

CCT GAA GAA 
Pro Glu Glu 
210 

ATG CTA TGG 
Met Leu Trp 

CCA CTT GCT 
Pro Leu Ala 

TGT GGG CCA 
Cys Gly Pro 
260 

AAA ACA CCA 
Lys Thr Pro 

275 
TGC TTT GAG 
Cys Phe Glu 



750 



798 



846 



694 



942 



990 



1038 



1086 



1134 



1182 



1230 



1278 
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280 285 290 

TGC AAA GAA AGT CTG GAG ACC TGC TGC CAG AAG CAC AAG CTA TTT CAC 1326 
Cys Lys Glu Ser Leu Glu Thr Cys Cys Gin Lys His Lys Leu Phe His 

295 300 305 

CCA GAC ACC TGC AGC TGT GAG GAC ACA TGC CCC TTT CAT ACC AG A CCA 1374 
Pro Asp Thr Cys Ser Cys Glu Asp Arg Cys Pro Phe His Thr Arg Pro 

310 315 320 

TGT GCA AGT GGC AAA ACA GCA TGT GCA AAG CAT TGC CGC TTT, CCA AAG 1422 
Cys Ala Ser Gly Lys Thr Ala Cys Ala Lys His Cys Arg Phe Pro Lys 
325 330 335 340 

GAG AAA AGG GCT GCC CAG GGG CCC CAC AGC CGA AAG AAT CCT 1464 
Glu Lys Arg Ala Ala Gin Gly Pro His Ser Arg Lys Asn Pro 

345 350 
TGATTCAGCG TTCCAAGTTC CCCATCCCTG TCATTTTTAA CAGCAT6CTG CTTTGCCAAG 1524 
TTGCTGTCAC TGTTTTTTTC CCAGGTGTTA AAAAAAAAAT CCATTTTACA CAGCACCACA 1584 
GTGAATCCAG ACCAACCTTC CATTCACACC AGCTAA6GAG TCCCTGGTTC ATTGATGGAT 1644 
GTCTTCTAGC TGCAGATGCC TCTGCGCACC AAGGAATGCA GAGGAGGGGA CCCATGTAAT 1704 
CCTTTTGTTT AGTTTTGTTT TTGTTTTTTG GT6AATGAGA AAGGT6T6CT G6TCATGGAA 1764 
7GGCAGGTGT CATATGACTG ATTACTCAGA GCA6ATGAG6 AAAACTGTAG TCTCTGAGTC 1824 
30 CTTTGCTAAT CGCAACTCTT GTGAATTATT CTGATTCTTT TTTATGCAGA ATTTGATTCG 1884 

TATGATCAGT ACTGACTTTC TGATTACTGT CCAGCTTATA GTCTTCCAGT TTAATGAACT 1944 
ACCATCTGAT GTTTCATATT TAAGTGTATT TAAAGAAAAT AAACACCATT ATTCAAGTCT 2004 

35 

S£Q 10 NO: 3 
SEQUENCE LENGTH: 16 
SEQUENCE TYPE: amino acid 
^ TOPOLOGY: linear 

MOLECULE TYPE: peptide 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
TISSUE TYPE: lung 
SEQUENCE DESCRIPTION: 

Cys Gly Pro Asn Lys Glu Leu Asp Glu Asn Thr Cys Gin Cys Val Cys 
15 10 15 



45 



SO 
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SEQ ID NO: 4 
SEQUENCE LENGTH: 27 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other- nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
AGGGATGGGG AACTTGGAAC GCTGAAT 

SEQ ID NO: S 
SEQUENCE LENGTH: 27 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
GATCTAATCC AGCACCCCAA AAACTGC 

SEQ ID NO: 6 
SEQUENCE LENGTH: 27 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CCATCCTAAT ACGACTCACT ATAGGGC 

SEQ ID NO: 7 
SEQUENCE LENGTH: 33 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 

CTGGTTCGGC CCAGAACTTG GAACGCTGAA TCA 
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SEQ ID NO: 8 
SEQUENCE LENGTH; 32 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DMA 

SEQUENCE DESCRIPTION: . 

CTCGCt'CGCC CACTAATACO ACTCACTATA GG 

SEQ ID NO: 9 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE; Other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
AATTAACCCT CACTAAAGG6 

SEQ ID NO: 10 
SEQUENCE LENGTH: 22 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CCAGGGTTTT CCCAGTCAC6 AC 

SEQ ID NO; 11 
SEQUENCE LENGTH: 23 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
ACTCACTATA GGGCTCGAGC GGC 
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SEQ ID NO: 12 
SEQUENCE LENGTH: 17 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other ' nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION; 
AAGTCTGGAG ACCTGCT 

SEQ ID NO: 13 
SEQUENCE LENGTH? 17 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 

CAGCAGGTCT CCAGACT 17 



SEQ ID NO: 14 
30 SEQUENCE LENGTH: 17 

SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

35 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 

C6CACCCAAG GAAT66A 17 

40 

SEQ ID NO: 15 
SEQUENCE LENGTH: 18 
^ SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
^ SEQUENCE DESCRIPTION: 

TGACACCTGG CCATTCCA 18 



55 
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SHQ ID NO: 16 
SEQUENCE LENGTH: 18 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other- nucleic acid, synthetic DNA 
SEQUENCE DESCKIPTION: 
CAtCAGATGG TAGTTCAT 

SEQ ID NO: 17 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
ATGCT6A6C6 AGAGTCCATA 

SEQ ID NO: 18 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CACTA6GTTT 6CGGCAACTT 

SEQ ID MO: 19 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synchetic DNA 
SEQUENCE DESCRIPTION: 
GCTGTTGGCA AGCACTTACA 
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SEQ ID NO? 20 
SEQUENCE LENGTH: 20 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other- nucleic acid, synthetic DMA 
SEQUENCE DESCRIPTION! 
GATCCATCCA GATCCCTGAA: 

SCO ID NO: 21 
SEQUENCE LENGTH: 19 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 
CAGATCAGGG CTGCTTCTA 

SEQ ID NO: 22 
SEQUENCE LENGTH: 32 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid, synthetic DNA 

SEQUENCE DESCRIPTION; 

TCCAGATCTT TTGC66CAAC TTTCTATGAC AT 

SEQ ID NO: 23 

SEQUENCE LENGTH: 33 
SEQUENCE TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 

SEQUENCE DESCRIPTION: 

CAGGTCGACT CAAACAGGCA CTAATTCAGG TAC 
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SEQ ID MO: 24 
SEQUENCE LENGTH: 1581 
5 SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cOKA to mRNA 

10 

ORIGINAL SOURCE: 

ORGANISHt mouse 
TISSUE TYPE: lung 
IS FEATURE: 

NAME/KEY: COS 
LOCATION: 96.. 1169 
IDENTIFICATION METHOD: E 

20 

SEQUENCE DESCRIPTION: 

TTCCGGGCTT TGCTGGAGAA TGCCTTTTGC AACACTTTTC AGTAGCTGCC TGGAAACAAC 60 
TGCTTAGTCA TCGGTAGACA TTTAAAATAT TCAAA ATG TAT GGA GAA TGG GGA 113 
25 Met Tyr Gly Glu Trp Gly 

5 

TTG GTG CAG 6GC TTC 161 
Leu Val Gin Gly Phe 
20 

TTT GAG CGA TCA TCC 209 
Phe Glu Arg Ser Ser 
35 

CGA GCA GCT TCT AGT 257 
Arg Ala Ala Ser Ser 
50 

6AC TGG AA6 CTG TGG 305 
Asp Trp Lys Leu Trp 

70 

ATG GAC TCA CGC TCA 353 
Met Asp Ser Arg Ser 
85 

TTC TAT GAC ACT GAA 401 
Phe Tyr Asp Thr Glu 
100 



30 



3$ 



40 



45 



SO 



ATG 6GG 
Met Gly 

AGG AGC 
Arg Ser 

CGG TCC 
Arg Ser 
40 
TTG GAG 
Leu Glu 

55 
CGA TGC 
Arg Cys 

GCA TCC 
Ala Ser 



AAT ATC 
Asa He 
10 

GAA CAT 
Glu His 
25 

ATG TTG 
Met Leu 

GAG TTG 
Glu Leu 

CGG TTG 
Arg Leu 

CAT CGC 
His Arg 
90 



CTC ATG ATG 
Leu Met Met 

GGA CCA GTG 
Gly Pro Val 

GAA CGA TCT 

Glu Arg Ser 
45 

CTG CAA ATC 
Leu Gin He 

60 

AAG CTC AAA 
Lys Leu Lys 
75 

TCC ACC AGA 
Ser Thr Arg 



TTC CAT GTG TAC 
Phe His Val Tyr 
15 

AAG GAT TTT TCT 
Lys Asp Phe Ser 
30 

GAA CAA CAG ATC 
Glu Gin Gin He 

GCG CAC TCT GAG 
Ala His Ser Glu 

65 

AGT CTT GCC AGT 
Ser Leu Ala Ser 
80 

TTT GCG GCA ACT 
Phe Ala Ala Thr 
95 
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ACA CTA AAA GTT ATA GAT GAA 
Thr Leu Lys Val lie Asp Glu 
105 

AGA GAG ACA TGC GTA GAA GTC 

Arg Glu Thr Cys Val Glu Val 
120 125 
ACA TTC TTC AAG CCC CCC T6T 
Thr Phe Phe Lys Pro Pro Cys 
135 140 
TGC AAC GAA GAG GGT GTG ATG 
Cys Asn Glu Glu Gly Val Met 
155 

TCC AAA CAG CTC TTT GAG ATA 
Ser Lys Gin Leu Phe Glu lie 
170 

TTA GTG CCT GTT AAA ATT GCC 
Leu Val Pro Val Lys lie Ala 
185 

ACG GGC CCC CGC CAT CCT TAC 
Thr Gly Pro Arg His Pro Tyr 
200 205 
CCA GAA GAA GAT GAA TGT. CCT 
Pro Glu Glu Asp Glu Cys Pro 
215 220 
ATG CTG TGG GAT AAC ACC AAA 
Met Leu Trp Asp Asn Thr Lys 
235 

CCA CTG CCT 6GG ACA GAA GAC 
Pro Leu Pro Gly Thr Glu Asp 

250 

TGT GGA CCG CAC ATG ACG TTT 
Cys Gly Pro His Met Thr Phe 
265 

AAA GCA CCA TGT CCG GGA GAT 
Lys Ala Pro Cys Pro Gly Asp 
280 285 



GAA TGG CAG AGG 
Glu Trp Gin Arg 
110 

GCC AGT GAG CTG 
Ala Ser Glu Leu 



GTA AAT 
Val Asn 

TGT ATG 
Cys Met 

TCA GTG 
Ser Val 
175 
AAC CAT 
Asa His 
190 

TCA ATT 
Ser lie 

CAT TCC 
His Ser 

TGT AAA 
Cys Lys 

CAC TCT 
His Ser 

255 
GAT GAA 
Asp Glu 
270 

CTC ATT 
Leu He 



GTC TTC 
Val Phe 
145 
AAC ACA 
Asn Thr 
160 

CCT CTG 
Pro Leu 

ACG GGT 
Thr Gly 

ATC AGA 
lie Arg 

AAG AAA 
Lys Lys 
225 
TGT GTT 
Cys Val 
240 

TAC CTC 
Tyr Leu 

GAT CGC 
Asp Arg 

CAG CAC 
Gin His 



ACC CAA TGC AGC CCT 449 
Thr Gin Cys Ser Pro 
115 

GGG AAG ACA ACC AAC 497 

Gly Lys Thr Thr Asn 

130 

CGG TGT GGA GGC TGC 545 
Arg Cys Gly Gly Cys 
150 

AGC ACC TCC TAC ATC 593 
Ser Thr Ser Tyr lie 
165 

ACA TCA GTG CCC GAG 641 
Thr Ser Val Pro Glu 
180 

TGT AAG TGC TTG CCC 689 
Cys Lys Cys Leu Pro 
195 

AGA TCC ATT CAG ACC 737 

Arg Ser lie Gin Thr 

210 

CTC TGT CCT ATT GAC 785 
Leu Cys Pro lie Asp 
230 

TTG CAA GAC GAG ACT 833 
Leu Gin Asp Glu Thr 
245 

CAG GAA CCC ACT CTC 881 
Gin Glu Pro Thr Leu 

260 

TGT GAG TGC GTC TGT 929 
Cys Glu Cys Val Cys 
275 

CCG GAA AAC TCC ACT 977 

Pro Glu Asn Cys Set 

290 
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TGC TTT GAG TGC AAA GAA AGT CTG GAG AGC TGC TGC CAA AAG CAC AAG 1025 
Cys Phe Glu Cys Lys Glu Ser Leu Glu Ser Cys Cys Gin Lys His Lys 
295 300 305 310 

ATT TTT CAC CCA GAC ACC TGC AGC TGT GAG GAC AGA TGT CCT TTT CAC 1073 
He Phe His Pro Asp Thr Cys Ser Cys Glu Asp Arg Cys Pro Phe His 

315 • 320 325 

ACC AGA ACA TGT GCA AGT AGA AAG CCA CCC TGT GGA AAG CAC TGC CGC 1121 
Thr Arg Thr Cys Ala Ser Arg Lys Pro Ala cys Gly Lys His Trp Arg 
330 335 340 

IS TTT CCA AAG GAG ACA AGG GCC CAG GGA CTC TAC AGC CAG GAG AAC CCT 1169 

Phe Pro Lys Glu Thr Arg Ala Gin Gly Leu Tyr Ser Gin Glu Asn Pro 

345 350 355 

TGATTCAACT TCCTTTCAAG TCCCCCCATC TCTGTCATTT TAAACAGCTC ACTGCTTT6T 1229 
CAAGTTGCTG TCACTGTTGC CCACTACCCC TGCCCCCCCC CCCCCCCGCC TCCAGGTGTT 1289 
AGAAAAGTTG ATTTGACCTA GTGTCATGGT AAAGCCACAT TTCCATGCAA TGGC6GCTAG 1349 
GTGATTCCCC AGTTCACTGA CAAATGACTT GTAGCTTCAA ATGTCTTTGC GCCATCANCA 1409 
25 CTCAAAAAGG AAGG6GTCTG AAGAACCCCT TGTTTGATAA ATAAAAACAG GTGCCTGAAA 1469 

CAAAATATTA GGTGCCACTC GATTGGGTCC CTCGGGCTGG CCAAATTCCA AGGGCAATGC 1529 
TCCTGAATTT ATTGTGCCCC TTCCTTAATG CGGAATTTCC TTTTGTTTGA TT 1581 

^ SEQ ID NO; 25 

SEQaSNCE LENGTH; 1491 . 
SEQUENCE TYPE; nucleic acid 
3S STRANDEDNESS : double 

TOPOLOGY; linear 
MOLECULE TYPE: CDNA to URNA 
ORIGINAL SOURCE: 

ORGANISM: rat 
TISSUE TYPE: lung 
FEATURE: 

NAME /KEY: COS 
LOCATION: 270.. 1247 
IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: 

GCCACCTCTT GATTATTTGT GCAGCGGGAA ACTTTGAAAT ACTTTTCATC TCTTTCTCCC 60 
ATACTAAGAT TGTGTGTGGC CGTGGGGGAG TCCTTGACTA ACTCAAGTCA TTTCATTGGA 120 
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25 



TTTTGATTAC AACTGATCAT GTGATATTTT TTTCCATGTA AAGTTTTGGG GCTTCAAACT 180 
TTGCTTCTGG AGAATGCCTT TTGCAACACT TTTCAGTAGC TGCCTGGAAA CAACTGCTTA 240 
GCCATCAGTG GACATTTGAA ATATTCAAA ATG TAT GGA GAG TGG GCC GCA GTG 293 

Met Tyr Gly Glu Trp Ala Aid Val 
1 5 

AAT ATT CTC ATG ATG TCC TAT GTG TAC CTG GTG CAG GGC TTC AGT ATT 341 
Asn lie Leu Met Met Ser Tyr Val Tyr Leu Val Gin Gly Phe Ser lie 

10 15 20 

GAA CAC CGA GCA GTG AAG GAT GTT TCT CTT GAG CGA TCA TCC CGG TCT 389 
IS Glu His Arg Ala Val Lys Asp Val Ser Leu Glu Arg Ser Ser Arg Ser 

25 30 35 40 

GTG TTG GAA CGT TCT GAA CAA CAG ATC CGC GCG GCT TCT ACT TTG GAA 437 
Val Leu Glu Arg Ser Glu Gin Gin He Arg Ala Ala Ser Thr Leu Glu 

45 50 55 

GAG TTG CTG CAA GTC GCA CAC TCT GAG GAC TGG AAG CTG TGG CGG TGC 485 
Glu Leu Leu Gin Val Ala His Ser Glu Asp Trp Lys Leu Trp Arg Cys 

60 65 70 

CGG TTG AAG CTT AAA AGT CTT GCC AAT GTG GAC TCG CGC TCA ACA TCC 533 
Arg Leu Lys Leu Lys Ser Leu Ala Asn Val Asp Ser Arg Ser Thr Ser 
75 80 85 

^ CAT CGC TCC ACC AGA TTT GCG GCA ACT TTC TAT GAT ACT GAA ACA CTA 581 

His Arg Ser Thr Arg Phe Ala Ala Thr Phe Tyr Asp Thr Glu Thr Leu 

90 95 100 

AAA GTT ATA GAT GAA GAA TGG CAG AG6 ACC CAA TGC AGC CCT AGA GAG 629 
Lys Val He Asp Glu Glu Trp Gin Arg Thr Gin Cys Ser Pro Arg Glu 
105 110 115 120 

ACA TGC GTA GAA GTC GCC AGT GAG CTG GG6 AAG ACA ACC AAC ACA TTT 677 
Thr Cys val Glu Val Ala Ser Glu Leu Gly Lys Thr Thr Asn Thr Phe 

125 130 135 

TTC AAG CCC CCT TGT GTA AAT GTC TTC CGG TGT GGA GGA TGC TGC AAT 725 
Phe Lys Pro Pro Cys val Asn Val Phe Arg Cys Gly Gly Cys Cys Asn 

140 145 ISO 

GAA GAG AGC GTG ATG TGT ATG AAC ACA AGC ACC TCC TAC ATC TCC AAA 773 
Glu Glu Ser Val Met Cys Met Asn Thr Ser Thr Ser Tyr He Ser Lys 
so 155 160 165 

CAG CTC TTT GAG ATA TCA GTG CCT CTG ACA TCA GTG CCC GAG TTA GTG 821 



35 



40 



45 



55 



22 



EP0935 001 A1 



10 



15 



Gin Leu Phe Glu lie Ser Val Pro Leu Thr Ser Val Pro Giu Leu Vdl 

170 175 180 

CCT GTT AAA ATT GCC AAC CAT ACG GGT TOT AAG TGT TTG CCC ACG GGC 869 
Pro Val Lys lie Ala Asn His Thr Gly Cys Lys Cys Leu Pro Thr Gly 
185 190 195 200 

CCC CGG CAT CCT TAT TCA ATT ATC AGA AGA TCC ATT GAG ATC CCA GAA 917 
Pro Arg His Pro Tyr Ser He He Arg Arg Ser He Gin He Pro. Glu 

• r- 2QS r ' . ■ : ^ V. ^lO^ -215 

GAA GAT CAA TGT CCT CAT TCC AAG AAA CTC TGT CCT GTT GAC ATG CTG 965 
Glu Asp Gin Cys Pro His Ser Lys Lys Leu Cys Pro Val Asp Met Leu 

220 225 230 

TGG GAT AAC ACC AAA TGT AAA TGT GTT TTA CAA GAT GAG AAT CCA CTG 1013 
20 Trp Asp Asn Thr Lys Cys Lys Cys Val Leu Gin Asp Glu Asn Pro Leu 
235 240 245 

CCT GGG ACA GAA GAC CAC TCT TAG CTC CAG GAA CCC GCT CTC TGT GGA 1061 
Pro Gly Thr Glu Asp His Ser Tyr Leu Gin Glu Pro Ala Leu Cys Gly 

25 

250 255 260 

CCA CAC ATG ATG TTT GAT GAA GAT C6C TGC GAG TGT GTC TGT AAA 6CA 1109 
Pro His Met Met Phe Asp Glu Asp Arg Cys Glu Cys Val Cys Lys Ala 

30 265 270 275 280 

CCA TGT CCT GGA GAT CTC ATT CAG CAC CCG GAA AAC TGC AGT TGC TTT n57 
Pro Cys Pro Gly Asp Leu. He Gin His Pro Glu Asn Cys Ser Cys Phe 

285 290 295 

GAA TGC AAA GAA AGT CTG GAA AGC TGT TGC CAA AAG CAC AAG ATG TTT 1205 
Glu Cys Lys Glu Ser Leu Glu Ser Cys Cys Gin Lys His Lys Met Phe 

300 305 310 

CAC CCT GAC ACC TGC AGA TCA ATG GTC TTT TCA CTG TCC CCT 1247 
His Pro Asp Thr Cys Arg Ser Met Val Phe Ser Leu Ser Pro 
315 320 325 

45 TAATTTGGTT TACTGGTGAC ATTTAAAGGA CATACTAACC TGATTTATTG GGGCTCTTTT 1307 

CTCTCAGGGC CCAAGCACAC TCTTAAAGGA ACACAGACGT TTGGCCTCTA AGAAATACAT 1367 
GGAAGTATTA TAGAGTGATG ATTAAATTGT CTTCTTGTTT CAAACAGGGT CTCATGATTA 1427 
CAGACCCGTA TTGCCATGCC TGCCGTCATG CTATCATGAG CGGAAAAGAA TCACTGGCAT 1487 
TTAA 1491 
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SEQ ID NO: 26 

SEQUENCE LENGTH: 20 
5 SEQUENCE TYPE: nucleic acid 

STRANDEDNESS: single 

TOPOLOGY: linear 
^0 MOLECULE TYPE: other nucleic acid, synthetic DNA 

SEQUENCE DESCRIPTION: 

6CTGC6A6TG TGTCTGTAAA 20 

15 

SEQ ID NO: 27 
SEQUENCE LENGTH I 25 
SEQUENCE TYPE: nucleic acid 
20 STRANDEDNESS: single 

TOPOLOGY: linear 

MOLECULE TYPE: other nucleic acid, synthetic DNA 
SEQUENCE DESCRIPTION: 

25 

6GGTA6TG66 CAACAGTGAC AGCAA 25 



30 

Claims 

1. A protein shown by SEQ ID NO: 1 or having the amino acid sequence derived therefrom in which one or more 
35 amino adds are substituted, deleted, or added. 

2. A protein encoded by a DNA hybridizing with the DNA shown by SEQ ID NO: 2. 

3. A DNA encoding the protein of Claim 1. 

40 

4. A DNA hybridizing with the DNA shown by SEQ ID NO: 2. 

5. A vector containing the DNA of Claim 3 or 4. 
45 6. A transformant carrying the vector of Claim 5. 

7. A method of producing the protein of Claim 1 or 2, which comprises culturing the transformant of Claim 6. 

8. An antibody binding to the protein of Claim 1 or 2. 

so 

9. A method of screening a compound binding to the protein of Claim 1 or 2. which comprises a step of detecting the 
activity of the protein of Claim 1 or 2 to bind to a test sample. 

10. A compound binding to the protein of Claim 1 or 2, wherein the compound have been isolated by the method of 
55 Claim 9. 
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Fig. 1 
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Fig. 2 
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Fig. 3 
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Fig- 4 

a) Hydrophobicity 




3SS 



b) Prediction of the human VEGF-D signal peptide 
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